Overview
Brought to you by YData
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 45000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 15.0 MiB |
| Average record size in memory | 348.8 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 5 |
| Boolean | 1 |
cb_person_cred_hist_length is highly overall correlated with person_age and 1 other fields | High correlation |
loan_amnt is highly overall correlated with loan_percent_income | High correlation |
loan_percent_income is highly overall correlated with loan_amnt | High correlation |
loan_status is highly overall correlated with previous_loan_defaults_on_file | High correlation |
person_age is highly overall correlated with cb_person_cred_hist_length and 1 other fields | High correlation |
person_emp_exp is highly overall correlated with cb_person_cred_hist_length and 1 other fields | High correlation |
previous_loan_defaults_on_file is highly overall correlated with loan_status | High correlation |
person_income is highly skewed (γ1 = 34.13758313) | Skewed |
person_emp_exp has 9566 (21.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-12-19 20:07:50.162912 |
|---|---|
| Analysis finished | 2024-12-19 20:07:53.235966 |
| Duration | 3.07 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
person_age
Real number (ℝ)
High correlation 
| Distinct | 60 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.764178 |
| Minimum | 20 |
|---|---|
| Maximum | 144 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 351.7 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 24 |
| median | 26 |
| Q3 | 30 |
| 95-th percentile | 39 |
| Maximum | 144 |
| Range | 124 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 6.0451082 |
|---|---|
| Coefficient of variation (CV) | 0.2177305 |
| Kurtosis | 18.649449 |
| Mean | 27.764178 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.548154 |
| Sum | 1249388 |
| Variance | 36.543333 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 5254 | |
| 24 | 5138 | |
| 25 | 4507 | |
| 22 | 4236 | |
| 26 | 3659 | 8.1% |
| 27 | 3095 | 6.9% |
| 28 | 2728 | 6.1% |
| 29 | 2455 | 5.5% |
| 30 | 2021 | 4.5% |
| 31 | 1645 | 3.7% |
| Other values (50) | 10262 |
| Value | Count | Frequency (%) |
| 20 | 17 | < 0.1% |
| 21 | 1289 | 2.9% |
| 22 | 4236 | |
| 23 | 5254 | |
| 24 | 5138 | |
| 25 | 4507 | |
| 26 | 3659 | |
| 27 | 3095 | |
| 28 | 2728 | |
| 29 | 2455 |
| Value | Count | Frequency (%) |
| 144 | 3 | |
| 123 | 2 | |
| 116 | 1 | < 0.1% |
| 109 | 1 | < 0.1% |
| 94 | 1 | < 0.1% |
| 84 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 78 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| 73 | 3 |
person_gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| male | |
|---|---|
| female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.8959556 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | female |
|---|---|
| 2nd row | female |
| 3rd row | female |
| 4th row | female |
| 5th row | male |
Common Values
| Value | Count | Frequency (%) |
| male | 24841 | |
| female | 20159 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 24841 | |
| female | 20159 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 65159 | |
| m | 45000 | |
| a | 45000 | |
| l | 45000 | |
| f | 20159 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 220318 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 65159 | |
| m | 45000 | |
| a | 45000 | |
| l | 45000 | |
| f | 20159 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 220318 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 65159 | |
| m | 45000 | |
| a | 45000 | |
| l | 45000 | |
| f | 20159 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 220318 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 65159 | |
| m | 45000 | |
| a | 45000 | |
| l | 45000 | |
| f | 20159 | 9.1% |
person_education
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
| Bachelor | |
|---|---|
| Associate | |
| High School | |
| Master | |
| Doctorate | 621 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.769 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Master |
|---|---|
| 2nd row | High School |
| 3rd row | High School |
| 4th row | Bachelor |
| 5th row | Master |
Common Values
| Value | Count | Frequency (%) |
| Bachelor | 13399 | |
| Associate | 12028 | |
| High School | 11972 | |
| Master | 6980 | |
| Doctorate | 621 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bachelor | 13399 | |
| associate | 12028 | |
| high | 11972 | |
| school | 11972 | |
| master | 6980 | |
| doctorate | 621 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 50613 | |
| c | 38020 | |
| h | 37343 | |
| e | 33028 | |
| a | 33028 | |
| s | 31036 | 7.9% |
| l | 25371 | 6.4% |
| i | 24000 | 6.1% |
| r | 21000 | 5.3% |
| t | 20250 | 5.1% |
| Other values (8) | 80916 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 394605 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 50613 | |
| c | 38020 | |
| h | 37343 | |
| e | 33028 | |
| a | 33028 | |
| s | 31036 | 7.9% |
| l | 25371 | 6.4% |
| i | 24000 | 6.1% |
| r | 21000 | 5.3% |
| t | 20250 | 5.1% |
| Other values (8) | 80916 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 394605 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 50613 | |
| c | 38020 | |
| h | 37343 | |
| e | 33028 | |
| a | 33028 | |
| s | 31036 | 7.9% |
| l | 25371 | 6.4% |
| i | 24000 | 6.1% |
| r | 21000 | 5.3% |
| t | 20250 | 5.1% |
| Other values (8) | 80916 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 394605 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 50613 | |
| c | 38020 | |
| h | 37343 | |
| e | 33028 | |
| a | 33028 | |
| s | 31036 | 7.9% |
| l | 25371 | 6.4% |
| i | 24000 | 6.1% |
| r | 21000 | 5.3% |
| t | 20250 | 5.1% |
| Other values (8) | 80916 |
person_income
Real number (ℝ)
Skewed 
| Distinct | 33989 |
|---|---|
| Distinct (%) | 75.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 80319.053 |
| Minimum | 8000 |
|---|---|
| Maximum | 7200766 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 351.7 KiB |
Quantile statistics
| Minimum | 8000 |
|---|---|
| 5-th percentile | 28366.7 |
| Q1 | 47204 |
| median | 67048 |
| Q3 | 95789.25 |
| 95-th percentile | 166754.7 |
| Maximum | 7200766 |
| Range | 7192766 |
| Interquartile range (IQR) | 48585.25 |
Descriptive statistics
| Standard deviation | 80422.499 |
|---|---|
| Coefficient of variation (CV) | 1.0012879 |
| Kurtosis | 2398.6848 |
| Mean | 80319.053 |
| Median Absolute Deviation (MAD) | 23124 |
| Skewness | 34.137583 |
| Sum | 3.6143574 × 109 |
| Variance | 6.4677783 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8000 | 15 | < 0.1% |
| 73011 | 10 | < 0.1% |
| 36995 | 9 | < 0.1% |
| 60914 | 8 | < 0.1% |
| 37020 | 8 | < 0.1% |
| 73082 | 7 | < 0.1% |
| 60864 | 7 | < 0.1% |
| 67131 | 7 | < 0.1% |
| 72951 | 7 | < 0.1% |
| 73040 | 7 | < 0.1% |
| Other values (33979) | 44915 |
| Value | Count | Frequency (%) |
| 8000 | 15 | |
| 8037 | 1 | < 0.1% |
| 8104 | 1 | < 0.1% |
| 8186 | 1 | < 0.1% |
| 8248 | 1 | < 0.1% |
| 8267 | 1 | < 0.1% |
| 8277 | 1 | < 0.1% |
| 8302 | 1 | < 0.1% |
| 8518 | 1 | < 0.1% |
| 9364 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7200766 | 1 | |
| 5556399 | 1 | |
| 5545545 | 1 | |
| 2448661 | 1 | |
| 2280980 | 1 | |
| 2139143 | 1 | |
| 2012954 | 1 | |
| 1741243 | 1 | |
| 1728974 | 1 | |
| 1661567 | 1 |
person_emp_exp
Real number (ℝ)
High correlation  Zeros 
| Distinct | 63 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.4103333 |
| Minimum | 0 |
|---|---|
| Maximum | 125 |
| Zeros | 9566 |
| Zeros (%) | 21.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 351.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 17 |
| Maximum | 125 |
| Range | 125 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 6.0635321 |
|---|---|
| Coefficient of variation (CV) | 1.1207317 |
| Kurtosis | 19.168324 |
| Mean | 5.4103333 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.5949174 |
| Sum | 243465 |
| Variance | 36.766421 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9566 | |
| 2 | 4134 | |
| 1 | 4061 | |
| 3 | 3890 | |
| 4 | 3524 | 7.8% |
| 5 | 3000 | 6.7% |
| 6 | 2717 | 6.0% |
| 7 | 2204 | 4.9% |
| 8 | 1890 | 4.2% |
| 9 | 1575 | 3.5% |
| Other values (53) | 8439 |
| Value | Count | Frequency (%) |
| 0 | 9566 | |
| 1 | 4061 | |
| 2 | 4134 | |
| 3 | 3890 | |
| 4 | 3524 | 7.8% |
| 5 | 3000 | 6.7% |
| 6 | 2717 | 6.0% |
| 7 | 2204 | 4.9% |
| 8 | 1890 | 4.2% |
| 9 | 1575 | 3.5% |
| Value | Count | Frequency (%) |
| 125 | 1 | |
| 124 | 1 | |
| 121 | 1 | |
| 101 | 1 | |
| 100 | 1 | |
| 93 | 1 | |
| 85 | 1 | |
| 76 | 1 | |
| 62 | 1 | |
| 61 | 1 |
person_home_ownership
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| RENT | |
|---|---|
| MORTGAGE | |
| OWN | |
| OTHER | 117 |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 5.5804889 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RENT |
|---|---|
| 2nd row | OWN |
| 3rd row | MORTGAGE |
| 4th row | RENT |
| 5th row | RENT |
Common Values
| Value | Count | Frequency (%) |
| RENT | 23443 | |
| MORTGAGE | 18489 | |
| OWN | 2951 | 6.6% |
| OTHER | 117 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rent | 23443 | |
| mortgage | 18489 | |
| own | 2951 | 6.6% |
| other | 117 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 42049 | |
| E | 42049 | |
| T | 42049 | |
| G | 36978 | |
| N | 26394 | |
| O | 21557 | |
| M | 18489 | |
| A | 18489 | |
| W | 2951 | 1.2% |
| H | 117 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 251122 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 42049 | |
| E | 42049 | |
| T | 42049 | |
| G | 36978 | |
| N | 26394 | |
| O | 21557 | |
| M | 18489 | |
| A | 18489 | |
| W | 2951 | 1.2% |
| H | 117 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 251122 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 42049 | |
| E | 42049 | |
| T | 42049 | |
| G | 36978 | |
| N | 26394 | |
| O | 21557 | |
| M | 18489 | |
| A | 18489 | |
| W | 2951 | 1.2% |
| H | 117 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 251122 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 42049 | |
| E | 42049 | |
| T | 42049 | |
| G | 36978 | |
| N | 26394 | |
| O | 21557 | |
| M | 18489 | |
| A | 18489 | |
| W | 2951 | 1.2% |
| H | 117 | < 0.1% |
loan_amnt
Real number (ℝ)
High correlation 
| Distinct | 4483 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9583.1576 |
| Minimum | 500 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 351.7 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 2000 |
| Q1 | 5000 |
| median | 8000 |
| Q3 | 12237.25 |
| 95-th percentile | 24000 |
| Maximum | 35000 |
| Range | 34500 |
| Interquartile range (IQR) | 7237.25 |
Descriptive statistics
| Standard deviation | 6314.8867 |
|---|---|
| Coefficient of variation (CV) | 0.65895678 |
| Kurtosis | 1.3512152 |
| Mean | 9583.1576 |
| Median Absolute Deviation (MAD) | 3800 |
| Skewness | 1.1797313 |
| Sum | 4.3124209 × 108 |
| Variance | 39877794 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 3617 | 8.0% |
| 5000 | 2787 | 6.2% |
| 6000 | 2426 | 5.4% |
| 12000 | 2416 | 5.4% |
| 15000 | 2004 | 4.5% |
| 8000 | 1928 | 4.3% |
| 4000 | 1406 | 3.1% |
| 20000 | 1385 | 3.1% |
| 3000 | 1378 | 3.1% |
| 7000 | 1314 | 2.9% |
| Other values (4473) | 24339 |
| Value | Count | Frequency (%) |
| 500 | 5 | |
| 563 | 1 | < 0.1% |
| 700 | 1 | < 0.1% |
| 725 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| 900 | 2 | < 0.1% |
| 912 | 1 | < 0.1% |
| 922 | 1 | < 0.1% |
| 950 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 234 | |
| 34826 | 1 | < 0.1% |
| 34800 | 1 | < 0.1% |
| 34664 | 1 | < 0.1% |
| 34375 | 1 | < 0.1% |
| 34322 | 1 | < 0.1% |
| 34121 | 1 | < 0.1% |
| 34000 | 4 | < 0.1% |
| 33950 | 2 | < 0.1% |
| 33800 | 1 | < 0.1% |
loan_intent
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
| EDUCATION | |
|---|---|
| MEDICAL | |
| VENTURE | |
| PERSONAL | |
| DEBTCONSOLIDATION |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 10.012711 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PERSONAL |
|---|---|
| 2nd row | EDUCATION |
| 3rd row | MEDICAL |
| 4th row | MEDICAL |
| 5th row | MEDICAL |
Common Values
| Value | Count | Frequency (%) |
| EDUCATION | 9153 | |
| MEDICAL | 8548 | |
| VENTURE | 7819 | |
| PERSONAL | 7552 | |
| DEBTCONSOLIDATION | 7145 | |
| HOMEIMPROVEMENT | 4783 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| education | 9153 | |
| medical | 8548 | |
| venture | 7819 | |
| personal | 7552 | |
| debtconsolidation | 7145 | |
| homeimprovement | 4783 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 62385 | |
| O | 47706 | |
| N | 43597 | |
| I | 36774 | |
| T | 36045 | |
| A | 32398 | 7.2% |
| D | 31991 | 7.1% |
| C | 24846 | 5.5% |
| L | 23245 | 5.2% |
| M | 22897 | 5.1% |
| Other values (7) | 88688 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 450572 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 62385 | |
| O | 47706 | |
| N | 43597 | |
| I | 36774 | |
| T | 36045 | |
| A | 32398 | 7.2% |
| D | 31991 | 7.1% |
| C | 24846 | 5.5% |
| L | 23245 | 5.2% |
| M | 22897 | 5.1% |
| Other values (7) | 88688 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 450572 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 62385 | |
| O | 47706 | |
| N | 43597 | |
| I | 36774 | |
| T | 36045 | |
| A | 32398 | 7.2% |
| D | 31991 | 7.1% |
| C | 24846 | 5.5% |
| L | 23245 | 5.2% |
| M | 22897 | 5.1% |
| Other values (7) | 88688 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 450572 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 62385 | |
| O | 47706 | |
| N | 43597 | |
| I | 36774 | |
| T | 36045 | |
| A | 32398 | 7.2% |
| D | 31991 | 7.1% |
| C | 24846 | 5.5% |
| L | 23245 | 5.2% |
| M | 22897 | 5.1% |
| Other values (7) | 88688 |
loan_int_rate
Real number (ℝ)
| Distinct | 1302 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.006606 |
| Minimum | 5.42 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 351.7 KiB |
Quantile statistics
| Minimum | 5.42 |
|---|---|
| 5-th percentile | 6.17 |
| Q1 | 8.59 |
| median | 11.01 |
| Q3 | 12.99 |
| 95-th percentile | 16 |
| Maximum | 20 |
| Range | 14.58 |
| Interquartile range (IQR) | 4.4 |
Descriptive statistics
| Standard deviation | 2.9788083 |
|---|---|
| Coefficient of variation (CV) | 0.27063823 |
| Kurtosis | -0.42033531 |
| Mean | 11.006606 |
| Median Absolute Deviation (MAD) | 2.13 |
| Skewness | 0.21378407 |
| Sum | 495297.26 |
| Variance | 8.8732988 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11.01 | 3329 | 7.4% |
| 10.99 | 804 | 1.8% |
| 7.51 | 798 | 1.8% |
| 7.49 | 687 | 1.5% |
| 7.88 | 673 | 1.5% |
| 5.42 | 608 | 1.4% |
| 7.9 | 606 | 1.3% |
| 11.49 | 514 | 1.1% |
| 9.99 | 484 | 1.1% |
| 13.49 | 475 | 1.1% |
| Other values (1292) | 36022 |
| Value | Count | Frequency (%) |
| 5.42 | 608 | |
| 5.43 | 2 | < 0.1% |
| 5.44 | 2 | < 0.1% |
| 5.46 | 1 | < 0.1% |
| 5.47 | 5 | < 0.1% |
| 5.48 | 4 | < 0.1% |
| 5.49 | 4 | < 0.1% |
| 5.5 | 1 | < 0.1% |
| 5.51 | 3 | < 0.1% |
| 5.52 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 84 | |
| 19.91 | 9 | < 0.1% |
| 19.9 | 1 | < 0.1% |
| 19.82 | 5 | < 0.1% |
| 19.8 | 1 | < 0.1% |
| 19.79 | 4 | < 0.1% |
| 19.74 | 4 | < 0.1% |
| 19.69 | 12 | < 0.1% |
| 19.66 | 3 | < 0.1% |
| 19.62 | 1 | < 0.1% |
loan_percent_income
Real number (ℝ)
High correlation 
| Distinct | 64 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.13972489 |
| Minimum | 0 |
|---|---|
| Maximum | 0.66 |
| Zeros | 27 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 351.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.03 |
| Q1 | 0.07 |
| median | 0.12 |
| Q3 | 0.19 |
| 95-th percentile | 0.31 |
| Maximum | 0.66 |
| Range | 0.66 |
| Interquartile range (IQR) | 0.12 |
Descriptive statistics
| Standard deviation | 0.087212308 |
|---|---|
| Coefficient of variation (CV) | 0.6241716 |
| Kurtosis | 1.0824162 |
| Mean | 0.13972489 |
| Median Absolute Deviation (MAD) | 0.05 |
| Skewness | 1.0345122 |
| Sum | 6287.62 |
| Variance | 0.0076059867 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.08 | 2593 | 5.8% |
| 0.1 | 2421 | 5.4% |
| 0.07 | 2415 | 5.4% |
| 0.09 | 2295 | 5.1% |
| 0.06 | 2242 | 5.0% |
| 0.12 | 2216 | 4.9% |
| 0.05 | 2176 | 4.8% |
| 0.11 | 2158 | 4.8% |
| 0.14 | 1960 | 4.4% |
| 0.04 | 1950 | 4.3% |
| Other values (54) | 22574 |
| Value | Count | Frequency (%) |
| 0 | 27 | 0.1% |
| 0.01 | 315 | 0.7% |
| 0.02 | 944 | 2.1% |
| 0.03 | 1488 | |
| 0.04 | 1950 | |
| 0.05 | 2176 | |
| 0.06 | 2242 | |
| 0.07 | 2415 | |
| 0.08 | 2593 | |
| 0.09 | 2295 |
| Value | Count | Frequency (%) |
| 0.66 | 1 | < 0.1% |
| 0.63 | 1 | < 0.1% |
| 0.62 | 2 | < 0.1% |
| 0.61 | 2 | < 0.1% |
| 0.59 | 1 | < 0.1% |
| 0.58 | 1 | < 0.1% |
| 0.57 | 1 | < 0.1% |
| 0.56 | 5 | |
| 0.55 | 5 | |
| 0.54 | 8 |
cb_person_cred_hist_length
Real number (ℝ)
High correlation 
| Distinct | 29 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8674889 |
| Minimum | 2 |
|---|---|
| Maximum | 30 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 351.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 14 |
| Maximum | 30 |
| Range | 28 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.8797018 |
|---|---|
| Coefficient of variation (CV) | 0.66122014 |
| Kurtosis | 3.7259445 |
| Mean | 5.8674889 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.63172 |
| Sum | 264037 |
| Variance | 15.052086 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 8653 | |
| 3 | 8312 | |
| 2 | 6537 | |
| 5 | 3082 | 6.8% |
| 6 | 2966 | 6.6% |
| 7 | 2889 | 6.4% |
| 8 | 2800 | 6.2% |
| 9 | 2685 | 6.0% |
| 10 | 2457 | 5.5% |
| 12 | 715 | 1.6% |
| Other values (19) | 3904 |
| Value | Count | Frequency (%) |
| 2 | 6537 | |
| 3 | 8312 | |
| 4 | 8653 | |
| 5 | 3082 | 6.8% |
| 6 | 2966 | 6.6% |
| 7 | 2889 | 6.4% |
| 8 | 2800 | 6.2% |
| 9 | 2685 | 6.0% |
| 10 | 2457 | 5.5% |
| 11 | 712 | 1.6% |
| Value | Count | Frequency (%) |
| 30 | 23 | |
| 29 | 15 | |
| 28 | 29 | |
| 27 | 23 | |
| 26 | 20 | |
| 25 | 23 | |
| 24 | 34 | |
| 23 | 26 | |
| 22 | 32 | |
| 21 | 24 |
credit_score
Real number (ℝ)
| Distinct | 340 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 632.60876 |
| Minimum | 390 |
|---|---|
| Maximum | 850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 351.7 KiB |
Quantile statistics
| Minimum | 390 |
|---|---|
| 5-th percentile | 539 |
| Q1 | 601 |
| median | 640 |
| Q3 | 670 |
| 95-th percentile | 703 |
| Maximum | 850 |
| Range | 460 |
| Interquartile range (IQR) | 69 |
Descriptive statistics
| Standard deviation | 50.435865 |
|---|---|
| Coefficient of variation (CV) | 0.079726789 |
| Kurtosis | 0.20302186 |
| Mean | 632.60876 |
| Median Absolute Deviation (MAD) | 33 |
| Skewness | -0.61026083 |
| Sum | 28467394 |
| Variance | 2543.7765 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 658 | 406 | 0.9% |
| 649 | 398 | 0.9% |
| 652 | 396 | 0.9% |
| 663 | 394 | 0.9% |
| 647 | 393 | 0.9% |
| 650 | 391 | 0.9% |
| 654 | 391 | 0.9% |
| 667 | 390 | 0.9% |
| 653 | 390 | 0.9% |
| 656 | 386 | 0.9% |
| Other values (330) | 41065 |
| Value | Count | Frequency (%) |
| 390 | 1 | < 0.1% |
| 418 | 1 | < 0.1% |
| 419 | 1 | < 0.1% |
| 420 | 1 | < 0.1% |
| 421 | 1 | < 0.1% |
| 430 | 1 | < 0.1% |
| 431 | 2 | |
| 434 | 1 | < 0.1% |
| 435 | 4 | |
| 437 | 2 |
| Value | Count | Frequency (%) |
| 850 | 1 | |
| 807 | 1 | |
| 805 | 1 | |
| 792 | 1 | |
| 789 | 1 | |
| 784 | 2 | |
| 773 | 1 | |
| 772 | 1 | |
| 770 | 1 | |
| 768 | 1 |
previous_loan_defaults_on_file
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.1 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 22858 | |
| False | 22142 |
loan_status
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 35000 | |
| 1 | 10000 | 22.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 35000 | |
| 1 | 10000 | 22.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 35000 | |
| 1 | 10000 | 22.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 45000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35000 | |
| 1 | 10000 | 22.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 45000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35000 | |
| 1 | 10000 | 22.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 45000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35000 | |
| 1 | 10000 | 22.2% |
Interactions
Correlations
| cb_person_cred_hist_length | credit_score | loan_amnt | loan_int_rate | loan_intent | loan_percent_income | loan_status | person_age | person_education | person_emp_exp | person_gender | person_home_ownership | person_income | previous_loan_defaults_on_file | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| cb_person_cred_hist_length | 1.000 | 0.142 | 0.043 | 0.017 | 0.054 | -0.037 | 0.020 | 0.821 | 0.091 | 0.750 | 0.026 | 0.028 | 0.093 | 0.026 |
| credit_score | 0.142 | 1.000 | 0.006 | 0.011 | 0.016 | -0.012 | 0.008 | 0.160 | 0.129 | 0.172 | 0.005 | 0.000 | 0.023 | 0.178 |
| loan_amnt | 0.043 | 0.006 | 1.000 | 0.105 | 0.030 | 0.666 | 0.126 | 0.064 | 0.000 | 0.052 | 0.005 | 0.090 | 0.405 | 0.066 |
| loan_int_rate | 0.017 | 0.011 | 0.105 | 1.000 | 0.017 | 0.124 | 0.363 | 0.013 | 0.004 | 0.016 | 0.000 | 0.084 | -0.033 | 0.198 |
| loan_intent | 0.054 | 0.016 | 0.030 | 0.017 | 1.000 | 0.018 | 0.142 | 0.030 | 0.012 | 0.029 | 0.000 | 0.082 | 0.010 | 0.080 |
| loan_percent_income | -0.037 | -0.012 | 0.666 | 0.124 | 0.018 | 1.000 | 0.415 | -0.056 | 0.000 | -0.050 | 0.000 | 0.091 | -0.353 | 0.220 |
| loan_status | 0.020 | 0.008 | 0.126 | 0.363 | 0.142 | 0.415 | 1.000 | 0.012 | 0.000 | 0.014 | 0.000 | 0.258 | 0.009 | 0.543 |
| person_age | 0.821 | 0.160 | 0.064 | 0.013 | 0.030 | -0.056 | 0.012 | 1.000 | 0.060 | 0.888 | 0.024 | 0.015 | 0.143 | 0.030 |
| person_education | 0.091 | 0.129 | 0.000 | 0.004 | 0.012 | 0.000 | 0.000 | 0.060 | 1.000 | 0.065 | 0.000 | 0.006 | 0.004 | 0.040 |
| person_emp_exp | 0.750 | 0.172 | 0.052 | 0.016 | 0.029 | -0.050 | 0.014 | 0.888 | 0.065 | 1.000 | 0.021 | 0.009 | 0.120 | 0.028 |
| person_gender | 0.026 | 0.005 | 0.005 | 0.000 | 0.000 | 0.000 | 0.000 | 0.024 | 0.000 | 0.021 | 1.000 | 0.000 | 0.009 | 0.000 |
| person_home_ownership | 0.028 | 0.000 | 0.090 | 0.084 | 0.082 | 0.091 | 0.258 | 0.015 | 0.006 | 0.009 | 0.000 | 1.000 | 0.008 | 0.140 |
| person_income | 0.093 | 0.023 | 0.405 | -0.033 | 0.010 | -0.353 | 0.009 | 0.143 | 0.004 | 0.120 | 0.009 | 0.008 | 1.000 | 0.008 |
| previous_loan_defaults_on_file | 0.026 | 0.178 | 0.066 | 0.198 | 0.080 | 0.220 | 0.543 | 0.030 | 0.040 | 0.028 | 0.000 | 0.140 | 0.008 | 1.000 |
Missing values
Sample
| person_age | person_gender | person_education | person_income | person_emp_exp | person_home_ownership | loan_amnt | loan_intent | loan_int_rate | loan_percent_income | cb_person_cred_hist_length | credit_score | previous_loan_defaults_on_file | loan_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 22.0 | female | Master | 71948.0 | 0 | RENT | 35000.0 | PERSONAL | 16.02 | 0.49 | 3.0 | 561 | No | 1 |
| 1 | 21.0 | female | High School | 12282.0 | 0 | OWN | 1000.0 | EDUCATION | 11.14 | 0.08 | 2.0 | 504 | Yes | 0 |
| 2 | 25.0 | female | High School | 12438.0 | 3 | MORTGAGE | 5500.0 | MEDICAL | 12.87 | 0.44 | 3.0 | 635 | No | 1 |
| 3 | 23.0 | female | Bachelor | 79753.0 | 0 | RENT | 35000.0 | MEDICAL | 15.23 | 0.44 | 2.0 | 675 | No | 1 |
| 4 | 24.0 | male | Master | 66135.0 | 1 | RENT | 35000.0 | MEDICAL | 14.27 | 0.53 | 4.0 | 586 | No | 1 |
| 5 | 21.0 | female | High School | 12951.0 | 0 | OWN | 2500.0 | VENTURE | 7.14 | 0.19 | 2.0 | 532 | No | 1 |
| 6 | 26.0 | female | Bachelor | 93471.0 | 1 | RENT | 35000.0 | EDUCATION | 12.42 | 0.37 | 3.0 | 701 | No | 1 |
| 7 | 24.0 | female | High School | 95550.0 | 5 | RENT | 35000.0 | MEDICAL | 11.11 | 0.37 | 4.0 | 585 | No | 1 |
| 8 | 24.0 | female | Associate | 100684.0 | 3 | RENT | 35000.0 | PERSONAL | 8.90 | 0.35 | 2.0 | 544 | No | 1 |
| 9 | 21.0 | female | High School | 12739.0 | 0 | OWN | 1600.0 | VENTURE | 14.74 | 0.13 | 3.0 | 640 | No | 1 |
| person_age | person_gender | person_education | person_income | person_emp_exp | person_home_ownership | loan_amnt | loan_intent | loan_int_rate | loan_percent_income | cb_person_cred_hist_length | credit_score | previous_loan_defaults_on_file | loan_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 44990 | 31.0 | male | Master | 136832.0 | 9 | RENT | 12319.0 | PERSONAL | 16.92 | 0.09 | 7.0 | 722 | No | 1 |
| 44991 | 24.0 | male | High School | 37786.0 | 0 | MORTGAGE | 13500.0 | EDUCATION | 13.43 | 0.36 | 4.0 | 612 | No | 1 |
| 44992 | 23.0 | female | Bachelor | 40925.0 | 0 | RENT | 9000.0 | PERSONAL | 11.01 | 0.22 | 4.0 | 487 | No | 1 |
| 44993 | 27.0 | female | High School | 35512.0 | 4 | RENT | 5000.0 | PERSONAL | 15.83 | 0.14 | 5.0 | 505 | No | 1 |
| 44994 | 24.0 | female | Associate | 31924.0 | 2 | RENT | 12229.0 | MEDICAL | 10.70 | 0.38 | 4.0 | 678 | No | 1 |
| 44995 | 27.0 | male | Associate | 47971.0 | 6 | RENT | 15000.0 | MEDICAL | 15.66 | 0.31 | 3.0 | 645 | No | 1 |
| 44996 | 37.0 | female | Associate | 65800.0 | 17 | RENT | 9000.0 | HOMEIMPROVEMENT | 14.07 | 0.14 | 11.0 | 621 | No | 1 |
| 44997 | 33.0 | male | Associate | 56942.0 | 7 | RENT | 2771.0 | DEBTCONSOLIDATION | 10.02 | 0.05 | 10.0 | 668 | No | 1 |
| 44998 | 29.0 | male | Bachelor | 33164.0 | 4 | RENT | 12000.0 | EDUCATION | 13.23 | 0.36 | 6.0 | 604 | No | 1 |
| 44999 | 24.0 | male | High School | 51609.0 | 1 | RENT | 6665.0 | DEBTCONSOLIDATION | 17.05 | 0.13 | 3.0 | 628 | No | 1 |